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This article describes the effectiveness and sustainability of teacher professional development interventions 
from a complexity viewpoint as well as a more 'standard' viewpoint. The first aim of this study is to give a 
theoretical overview of effective aspects of interventions regarding teachers' professionalizing using recent 
literature. The second aim is to re-interpret effectiveness and effectiveness studies using a complexity 
approach. The third aim is to empirically illustrate a complexity approach to the effectiveness of 
interventions using a multiple case study. We have described intervention specific aspects, teacher specific 
aspects, context specific aspects and implementation specific aspects and have shown in the cases that 
during an intervention these aspects intertwine and act as a complex dynamic system. The complexity 
approach to interventions has implications for future empirical research as well as for the design of 
interventions. In future research, results of small scale and large scale research should be combined, in 
order to obtain a better insight in all relevant aspects and their effect on teachers' behavior. Research ought 
to concentrate on the effects that all contextual aspects have intertwiningly in order to design more effective 
interventions and with better implementation processes. 


Introduction 

Children who are currently in preschool will enter the labor force around 2030, and will work 
until their retirement, around 2070. Nobody knows what the world will look like then, but it is 
likely that in order for children to be prepared for their professional career in the future, they 
must learn the skills for the 21st-century, namely critical thinking, problem solving, and 
decision making, among other skills (Binkley et al., 2012). The recognition that teachers play a 
central role in children's mastering of these skills is a necessary premise to acknowledge when 
considering interventions in education systems aimed at preparing children for the rapidly 
changing world (Ferguson, 1991; Grigg, Kelly, Gamoran, & Borman, 2012; Harwell, 2003; Hattie, 
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2003). If society aims to prepare pupils as well as possible for the future, teachers need to teach 
their pupils the latest developments in numerous areas. This also implies that, in order to keep 
pace with a rapidly changing world, teachers' professional development needs to continue after 
their formal education. However, much of the professional development that the market offers 
to teachers could be more effective than it is now (Garet et al., 2011; Gersten, Taylor, Keys, 
Rolfhus, & Newman-Gonchar, 2014; Mayer, 2004; Pressley, Graham, & Harris, 2006). The reason 
for this lack of effectiveness is not always poor design of the intervention itself, but rather the 
multitude of intertwining factors in the complex system surrounding a teacher and his [sic] 
classroom that influence the effectiveness of an intervention (Goldspink, 2007). Moreover, an 
intervention can be effective during the intervention period, but the question is whether these 
positive effects are sustainable when the interveners and the support and attention they give to 
school and teachers are no longer in place. Often the effects disappear relatively soon after the 
intervention is carried out (Han & Weiss, 2005; Noell, Witt, Gilbertson, Ranier, & Freeland, 

1997). 

In this paper we describe the effectiveness and sustainability of teacher professional 
development interventions from a complexity viewpoint as well as from a more 'standard' 
viewpoint. The first aim of this study is to give a theoretical overview of effective aspects of 
interventions regarding teachers' professionalizing using recent literature. The second aim is to 
re-interpret effectiveness and effectiveness studies using a complexity approach. The third aim 
is to empirically illustrate a complexity approach to the effectiveness of interventions using a 
multiple case study (Hetherington, 2013). 

2 Explaining the Effectiveness of Interventions Aimed at Teachers' Professional 

Development 


2.1 Professional development 

Professional development for teachers is a continuous process of professional growth (Day, 
1999; Evans, 2002; Keiny, 1994) and encompasses all types of facilitated learning opportunities 
as well as informal learning opportunities situated in practice (Day, 1999; Lieberman, 1996). It is 
a necessary process because, first, it enables teachers to learn to use the knowledge acquired in 
teacher training in practice, and, after that, it helps to keep up with developments in society. 
After initial teacher training, the first part of professional development consists of the 
experience a new teacher gains at his or her job. As early as 1938, Dewey observed that this 
increase in experience alone did not necessarily result in more sustained knowledge or 
sustained better teaching. Unsurprisingly, the learning curve of teachers rapidly declines, as 
teachers' learning stagnates after the first few years working in practice (Barth, 2004). To keep 
the learning curve constant, teachers are provided with an overwhelming choice in professional 
development activities (Hattie, 2009). However, it is likely that most training opportunities are 
not effective in creating sustained improvement of teacher effectiveness because they are not in 
line with teachers' motivational and intellectual requirements (Day, 1999; Imperley, Wilson, 
Barrar, & Fung, 2007). 
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The criteria for the effectiveness of a professional development intervention for teachers are 
ambiguous (Guskey, 2003). For this paper, we use a classification applied by Garet, Porter, 
Desimone, Birman, & Yoon (2001) and Desimone (2009). According to this classification, 
teachers' change in knowledge and attitudes, teachers' change in behavior in the classroom, and 
pupils' results are the three elements that provide indications for the intervention's 
effectiveness (Desimone, 2009; Garet et al., 2001). Achieving these goals in the long run, and 
thus providing an effective and sustainable intervention, depends not only on the intervention 
itself but also on whether the intervention has been successfully implemented (Durlak & DuPre, 
2008; Harwell, 2003; Noell & Gansle, 2009). Unfortunately, the effects of interventions tend to 
disappear relatively soon after the intervention is carried out (Han & Weiss, 2005; Noell et al., 

1997) . What, in addition to the quality of implementation, does the available literature tell us 
about the main aspects that have an influence on the effectiveness of professional development 
interventions? 

2.2 Important aspects of a professional development intervention 

Teachers are highly critical consumers of professional development activities. Day (1999) states 
that professional development for teachers "is not simply a cognitive process ... it demands 
emotional commitment. It will involve the head and the heart" (p. 47). Garet et al. (2001) and 
Desimone et al. (2009) make a distinction between various aspects that positively contribute to 
the effectiveness of an intervention. The first concerns the properties of the intervention itself, 
the intervention-specific aspects. The second includes those variables relate to the teacher as a 
person, and the third relates to the school organization within which a teacher works, i.e., the 
school context (Han & Weiss, 2005). In addition to these variables, we must distinguish aspects 
of implementation quality. 

2.2.1 Intervention specific aspects 

Interventions are characterized by three structural and three substantive characteristics 
(Armour & Yelling, 2004; Birman, Desimone, Porter, & Garet, 2000; Garet et al., 2001). Structural 
elements are form, duration and collective participation (Garet et al., 2001). As for form, 
traditional forms of education, such as workshops and external trainings without follow-up in 
the classroom, are not as effective as training within the school or classroom with a connection 
to the teaching practice (Fabiano et al., 2013; Goldenberg & Gallimore, 1991; Loucks-Horsley, 
Stiles, Mundry, Love, & Hewson, 2010). Teacher training in combination with direct 
performance feedback produces the strongest training effect (Noell, et. al.; Rose & Church, 

1998) . The duration aspect can be understood in two ways: the period of time over which the 
intervention is spread, and the number of contact hours. Literature suggests that both 
components of duration need to be long enough in order to be effective in passing on 
knowledge and in sustaining changes over time (Garet et al., 2001; Harwell, 2003), although 
schools often do not have enough time or money to undertake longer trainings. Collective 
participation, i.e., participation by a group of teachers from the same school, is also a positive 
structural aspect of interventions that allows teachers who work together to share learning 
experiences, so they can integrate their new learning experiences with their own school 
background and curriculum (Garet et al., 2001; Harwell, 2003). 
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As regards the substantive elements in an intervention, first, there must be a focus on content. 
Teachers often lack subject matter knowledge, including the related teaching practices in 
specific content domains. Together these are known as the pedagogical content knowledge 
(Shulman, 1987). When comparing interventions based on general pedagogical knowledge, or 
on content knowledge, or on pedagogical content knowledge, the literature shows more 
positive effects on students achievements when the education activities are focused on the 
pedagogical content knowledge (Cohen & Hill, 2000; Loucks-Horsley et al., 2010; Smith, 1999), 
apparently because the latter gives teachers more concrete ideas for their lessons. Secondly, the 
intervention needs to be based on active learning, by means of reflection on the way a subject 
can be taught in the classroom, such as discussion and dialogue. In this way, the participating 
teachers obtain opportunities to practice what they learn in the classroom and discuss it with a 
colleague or a coach (Harwell, 2003). Finally, coherence with other learning activities is 
necessary (Garet et al., 2001) so that something learned can be integrated with a range of 
activities within a school. 

2.2.2 Teacher specific aspects 

To begin, teachers' sense of self-efficacy is important for their level of openness to new ideas 
(Berman, McLaughlin, Bass, Pauly, & Zallman, 1977; Durlak & DuPre, 2008). In addition, 
teachers should feel the need for using the learned content in their own classroom (Harwell, 
2003), and they should feel intrinsically motivated to use the new practices in their daily 
routines because teachers' intrinsic motivation is very often the basis from which they started 
teaching (Dinham & Scott, 2004). Humans in general are basically in search for self-motivation 
and self-development since these aspects contribute to human well-being (Deci & Ryan, 2002; 
Diener, 1984). 

When performing a school intervention, it must be kept in mind that attempts toward 
changing teachers' habits and behaviors can be a threat to teachers' intrinsic motivation in the 
classroom. More specifically, it is important that teachers' basic concerns (Frijda, 1986), namely 
autonomy, relatedness and competence (Minnaert, Boekaerts, & de Brabander, 2007; Ryan & 
Deci, 2000; Steenbeek & Van Geert, 2007) should not be threatened. Autonomy plays a role in 
that teachers should feel like they can make their own decisions concerning when an 
intervention is carried out at a school. Secondly, teachers often feel like they lack the 
competence to act the way they are expected to during the intervention (Goodrum, Hackling, & 
Rennie, 2001; Jarvis & Pell, 2004; Palmer, 2002). Primary teachers may take a general degree in 
teaching; consequently they may not have content knowledge of specific domains such as 
science, which may result in feeling uncomfortable during professionalization courses aimed at 
teaching science to young children, for instance. And finally, teachers need to feel connected 
with colleagues and pupils when teaching, especially in a period when things change in their 
classroom. 

2.2.3 Context specific aspects 

Changes in teachers' behavior do not occur in the vacuum of the classroom only. Various 
political, bureaucratic and systemic factors influence teachers' adaptation to new working 
methods in schools (Adelman & Taylor, 2003; Goldspink, 2007). In this study, we shall confine 
ourselves to the school specific factors because these factors can be addressed directly during an 
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intervention. They include organizational support in training, for instance, extra time for study, 
practice and reflection on each other's practices with co-workers, as well as supervision and 
support by the school principal during the intervention's implementation and afterwards 
(Fullan, 2007; Gottfredson & Gottfredson, 2002). The school system should be involved in a way 
that, once external support from trainers or researchers ends, the system is ready to integrate 
the elements of the intervention into the existing practices. All levels of an organization, i.e., the 
classroom level, the teacher team level, and the level of the school board should work together 
in order to create sustainable change (Coburn, 2003; Goldspink, 2007). Shared decision making 
regarding the intervention is an aspect that adds to a better implementation (Durlak & DuPre, 
2008). If the intervention focuses on a group of teachers from the same school, it can promote 
teachers to work together, share mutual experiences, and find solutions to joint difficulties in 
the new way of working (Loucks-Horsley et al., 2010). 

2.2.4 Aspects regarding implementation 

As Boomstra (2014) describes, implementation (i.e., realizing the intervention) is an important 
aspect to consider when assessing the effectiveness of an intervention. The theoretical basis of a 
an intervention can be adequate, but the effectiveness can be negatively influenced by less 
sufficient execution of the intervention, or even by intervention activities that were never 
prescribed. Gearing et al. (2011) described four elements that contribute to minimizing the 
distance between the theoretical basis of an intervention and the delivery of components of the 
intervention in real practice: 1) good intervention design and protocols, 2) good intervention 
training, 3) sufficient monitoring of intervention delivery, and 4) sufficient monitoring of 
intervention receipt. These four components also need to be taken into account when designing 
an effective intervention. In addition, treatment integrity is an important aspect that greatly 
contributes to successful interventions. Treatment integrity is the extent to which an 
intervention is implemented as it was designed. When an intervention is not provided by 
trainers as intended, whatever effects occur will not be due to the intervention as such (Noell, 
Gresham, & Gansle, 2002). Documentation of the intervention, for example, can assure 
treatment adherence, whereas training of the trainers according to the documentation can 
assure their competence in providing the intervention as intended (Perepletchikova & Kazdin, 
2005). 

In fact, models that describe the effectiveness and sustainability of teacher professional 
development interventions from a standard viewpoint assume that a linear addition of all these 
factors can explain the effectiveness of an intervention in general. This might be true if a model 
is viewed as a purely statistical statement. That is to say, if the individual and temporal aspects 
of the intervention process are not taken into account, the effectiveness of an intervention across 
contexts and occasions can indeed be "explained" on the basis of how it combines and 
implements the elements discussed in the preceding section. However, the actual performing of 
an intervention in a concrete context cannot be modelled as a sum of the variables (Molenaar & 
Campbell, 2009). First of all, the actual performing of an intervention in a concrete context is a 
process, taking place in time. Second, it is a process, taking place in a particular context, with 
specific individuals and people who carry out the treatment. Third, it is a process taking place 
in a complex system, namely, a system of intertwining teacher, child, school, cultural and 
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community variables. It is highly likely that the statistical model emerging from sample-based 
effectiveness studies - which is a linear additive model — does not apply to the individual and 
complex processes of performing particular interventions. The point discussed in the next 
section is that understanding intervention effectiveness requires a complexity point of view in 
order to capture the temporal and idiosyncratic aspects of interventions. Nonetheless, it is 
important to make a distinction between designing an intervention, and performing an 
intervention. While designing an intervention, one can use the standard statistical model of 
effective components as a partial guideline. However, this is only a partial guideline in the 
sense that the designer of the intervention should be aware that performing the intervention 
will take place in a complex dynamic system with many intertwined and dynamically changing 
processes that are likely to be highly idiosyncratic, and in which the researcher inevitably 
becomes an embedded component, even if the researcher is not carrying out the intervention 
him or herself (Van Geert & Steenbeek, 2014). In the next section, we shall try to redefine 
teachers' professional development using a complexity approach. 

2.3 Teachers' professional development: a complexity approach 

Looking at professional development as a simple cause-and-effect process does not do justice to 
the complexity of functioning as a teacher in a specific school context, and to the complex world 
in which teachers function (Goldspink, 2007; Opfer & Pedder, 2011). Complexity research 
reasons that various "forces" within a person and within society act simultaneously and 
mutually to contribute (or not) to behavioral change. In this way, these "forces" contribute to 
the extent that teachers learn and develop as a result of an intervention, and to all the processes 
that the intervention brings about during its deployment and afterwards. The individual 
teacher and his or her classroom behavior can be seen as such a complex dynamic system in 
itself. 

A complex dynamic system is a system consisting of a number 1 of interacting elements or 
components that change each other's properties and consequently show self-organization, the 
emergence of self-sustaining states and patterns of states (Goldspink, 2007; Jorg, 2011; Van 
Geert & Steenbeek, 2010). This means that various influences occur that become coordinated 
and that explain the changes and stabilities in the individual teacher's learning process over 
time (Steenbeek & Van Geert, 2013). These influences are very diverse, and range from the 
teacher's own previous experiences with activities aimed at professional development to a 
teacher's work load, the external pressure a teacher experiences in the form of inspection 
reports and standardized school performance tests, and the importance a teacher assigns to the 
content to be learned. In short, the teachers' professional learning emerges out of the teacher- 
context dynamics. 


1 The number of interacting elements needed to constitute a complex system typically depends on the 
nature of the interactions. For instance, a system consisting of no more than three celestial bodies exerting 
gravitational forces onto each other already shows complex behavior. In the educational sciences, 
complex systems typically consist of one to a few dozen interacting elements. Macro scale social 
phenomena such as economics involve the interactions of many millions of producers and consumers. 
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Most current research in developmental psychology recognizes the importance of studying 
human learning and development in context (Van Geert, 2009). However, this context is usually 
seen as an independent and distal variable, e.g. as in studying the statistical association between 
the kind of school and the academic performance of children (Van Geert & Steenbeek, 2005). 
Using a complexity approach perspective, it is important to examine the context as a proximal, 
dynamic factor, meaning that the context is continuously created by the actions of the 
participants, and that the actions of the participants are continuously dependent on the created 
context. According to the complexity approach, context and person influence each other, as 
there is a "reciprocal determinism" between the two (Steenbeek & Van Geert, 2007). In this case, 
the school context in which the teacher functions must not be considered as a stable context, but 
as dynamic one, which changes itself over time in association with the changes in the teacher(s) 
and other components of the student in the school system. 

An intervention or activity aimed at improving the teacher's behavior can be regarded as an 
important component that changes the context in which the teacher functions, and acts as a kind 
of "perturbation" of the complex dynamic system of school-student-teacher relationships. Note 
perturbing a complex system does not necessarily imply that the system will change. In fact, 
complex dynamic systems tend to retain their stability by counteracting such perturbations in 
order to damp out its effects. Whether or not a perturbation will indeed lead to a change in a 
complex system depends on the properties of the perturbation and the conditions under which 
it happens. Given the mutuality of person and context, it is clear that the exact way in which a 
particular intervention (perturbation) unfolds is not independent of the reactions of the teacher 
to the intervention, or of the teachers' input in the intervention. The teacher's 'simplex system', 
i.e., the teacher's connected set of beliefs, representations, values, emotions, habits, practices 
and material tools plays a central role in this process. This 'simplex system' serves as a 
simplifying representation of the overarching complex system in which a person participates, 
and organizes the participants' actions (for more information about the notion of 'simplex 
systems', we refer to Van Geert & Steenbeek (2014)). 

That is to say, even if we guard the faithful implementation of a particular intervention, its 
concrete form over time will unavoidably be influenced by the teacher(s) and by the effect the 
intervention has on the teacher(s) and other significant parts of the school context. At the same 
time, the intervention changes the teacher as a learning individual as well. In this process of 
change, it is important to examine the fit between a specific intervention with a specific teacher 
in a specific context with a specific provider of the intervention. This fit is not only an 
idiosyncratic process, which means that the outcome is specific for this specific individual (or 
specific team of teachers), but also a dynamic process, in that it changes as a result of the 
interactions. 

With regard to the effect of interventions for teachers in a specific school context, this 
means that the (above described) intervention-specific aspects, teacher-specific aspects, context- 
specific aspects and aspects regarding implementation interact and combine in various ways, in 
such a way that a particular intervention can have multiple outcomes, dependent on the 
dynamics of the specific context-teacher relationship. The fact that average results can be seen 
for a particular sample or population of individuals does not imply that this average result is 
true for each member of the sample or population (Jarvis & Pell, 2004; Rose, Rouhani, & Fischer, 
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2013). In order to understand how and to what extent interventions work, they should be 
studied in the context of actual intervention processes taking place in individual teachers 
(Barlow & Nock, 2009; Molenaar & Campbell, 2009). In fact, what counts is the way in which the 
teacher assimilates something that is in principle a perturbation of the ongoing daily process, 
namely the intervention. The assimilation can take several dynamic forms. For instance, it may 
be the starting point for a positive change cycle. It may also cause resistance and changes that 
actually compensate for the attempt to change the current attractor state, etc. Thus, from a 
complexity viewpoint, implementation is the effect that the introduction of the perturbation (in 
this case the intervention) has on the current state of the 'teacher's system'. 

As can be seen in figure 2, a reciprocal influence exists between a teacher and every single 
group of aspects that plays a role in the intervention. This figure illustrates how a teacher and 
all aspects influence each other during the intervention. The arrow shows how the teacher and 
all aspects influence each other. This figure stresses the importance of the fit between a teacher 
and each single group of aspects in order for an intervention to be (more or less) effective for a 
particular teacher. Secondly, the figure stresses the fact that the influences exist between the 
teacher and all the different aspects at the same time, and that the influences are not one- 
directional, but multi-directional. 

Thirdly, Figure 0 shows that, when describing a concrete, locally situated intervention in a 
complex dynamic system, it is difficult to explain the effective aspects in isolation. In a complex 
dynamic system, these groups of aspects, for instance, context based and teacher specific aspects 
are all intertwined during the actual process of an intervention, and the effectiveness for an 
individual teacher dynamically emerges from this interaction. 

This implicates that, when conducting an intervention study, researchers should take into 
account that the separate aspects are important, but also that all these aspects are intertwined in 
a real world situation. This means that the combination of both approaches is meaningful. On 
the one hand, the standard sample-based average effect of the intervention with various aspects 
as largely independent variables contributing linearly to the statistical group effect can inform 
us about the net effect of an intervention for a particular group or population, and is typically 
important for the level of policymaking. On the other hand, the case-based dynamic 
understanding of how the actual intervention process leads to a particular result for a particular 
person (or for a particular school or team of teachers) is typically important for the level of 
actual intervening, i.e., this approach provides information about how actual interventions can 
be carried out and adapted within the limits of implementation integrity. In the third main 
section of this paper, we shall now illustrate the importance of both approaches by describing a 
multiple case study, in which two primary school teachers participated in an intervention 
aimed at teachers' professional development, especially with regard to improving the quality of 
their science lessons. 
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Intervention 



Figure 0. An intervention using a complexity approach 


3 Empirical Illustration: The Design of a Science Teaching Intervention and its 

Effects, Using a Complexity Approach 

3.1 Theoretical background 

Various studies have shown that an important determinant for young students' success in 
science education is the students' curiosity, a quality that young children start with naturally 
(Jirout & Klahr, 2012; Loewenstein, 1994). However, once young children go to school, this 
curiosity seems to vanish. Engel (2009) observed that teachers hardly encourage (and sometimes 
even discourage (Martin-Hansen, (2009)) inquiry and curiosity in the classroom. This lack of 
stimulation might explain the diminishing interest in learning science in the first school years of 
children (Osborne, Simon, & Collins, 2003). This finding implies that it is important to start from 
the early school years on with encouraging children's natural curiosity and scientific reasoning, 
and thus, with acquiring positive science experiences. Teachers play a significant role in 
providing these positive experiences, and therefore it is important to design and implement 
professional development trajectories aimed at improving teacher's knowledge, skills, and 
attitudes in teaching science. 
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In our study, the goal was to design an intervention for improving the quality of teachers' 
science teaching, based on principles from (School) Video Interaction Guidance (Kennedy, 
Landor, & Todd, 2011**; Van den Heijkant et al., 2004). The aim of the intervention was to 
enable lower grade teachers, without preliminary experience in teaching science, to teach 
science in their own classroom by carrying out small scientific experiments. Our hypothesis was 
that in the concrete situation of each particular teacher, the effective aspects would work 
together in a specific way, and in fact amplify each other to produce a specific effect of the 
intervention. We also assumed that these processes of self-amplification or self-reduction are 
likely to be highly idiosyncratic, and thus will show high inter-individual variability. We 
expected that there would be an effect of the intervention in each particular teacher, but that 
this effect would greatly differ between the teachers due to the idiographic process, i.e., specific 
interplay of the intertwining aspects within each teacher. 

3.2 The content of the intervention 

When developing the intervention, we started from the following basic assumptions. The first 
assumption is that children are curious about their environment. The second assumption is that 
teachers have some knowledge of basic science principles, for instance, of floating and sinking 
and gravity 2 . This implies that it should be possible for teachers to start with simple 
experiments in the classroom that do not require specific teacher training. The emphasis of the 
intervention was not on scientific content knowledge, but on pedagogical content knowledge in 
science lessons (Smith, 1999). Teachers needed to learn and experience how to enhance 
children's curiosity and scientific reasoning during these lessons. Therefore, teachers received 
as part of the intervention information they were not familiar with beforehand about 
questioning (Oliveira, 2010), about the empirical cycle, and about scaffolding., in a two hour 
theory session. Scaffolding (Granott et al., 2002; Mattanah, Pratt, Cowan, & Cowan, 2005; Van 
Geert & Steenbeek, 2005) was explained as a way of helping pupils to reach a higher level than 
they could reach on their own by being given parts of the solution to a problem. The aim of 
scaffolding is that pupils ultimately can give solutions independent of a teacher's help. The 
empirical cycle (De Groot, 1961) was explained as a five step inquiry scheme to provide 
structure to a science lesson. The empirical cycle, as applied to what young pupils can do in a 
science lesson, consists of the following steps: first asking your research question (for example 
why does a paperclip sink whereas a big piece of wood floats) second hypothesizing (for 
example, we think that the paperclip sinks because it is made out of iron), third designing a 
research model (for example, we take several iron objects, including a small metal box, and we 
predict that they will all sink), fourth observing (for example, try each metal object separately 
and observe whether it sinks are not), explaining the observed result (for example, because 
some metal objects do not like water, they will float). The cycle can then be repeated by taking 
the last step as its starting point, for example, the pupils ask how they could know whether 


2 It should be noted that teachers, and many other educated adults for that matter, can hold serious 
misconceptions about physical science concepts, such as those involving Archimedes law. However the 
basic science principles involved in the current science lessons were truly basic and intuitive, and none of 
the teachers showed any misconceptions on this level. 
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certain metal objects like or dislike water. In the current example, the explanation is not correct 
from a scientific point of view, and repeating the empirical cycle, supported by questions asked 
by the teacher, should help children to move towards better explanations and better predictions 

After the theory lesson, teachers prepared a science lesson four times to be given in their 
own classroom with a small group of children. Each lesson was recorded on video. Afterwards 
the coach selected relevant video fragments with a focus on teacher questioning, the use of the 
empirical cycle, and scaffolding in the interaction with the children. These fragments were 
reflected upon by teacher and coach together using the principles of Video Enhanced Reflective 
Practice (VERP) (Kennedy et al., 2011). The focus of the coaching sessions was to enable the 
teacher to discover in interaction with the coach, how to change behavior by discussing 
fragments that were representative of the teacher's behavior. 

3.3 The design of the intervention 

Important aspects of interventions that contribute to the effectiveness of an intervention and the 
way these aspects can best be addressed were specified above. The next sections describe how 
these aspects were incorporated in our science teaching intervention. 

3.3.1 Intervention specific aspects 

The form of the intervention was designed as training within the classroom with elements that 
have proven to be effective, as there is a direct connection between the teaching practice 
(Loucks-Horsley et al., 2010) and the use of direct performance feedback (Noell et al., 1997). The 
duration of the intervention was defined by the total period of time in which the intervention 
sessions were given and the number of contact hours. The intervention lasted five to eight 
weeks. Every coaching session consisted of a science lesson of 20 to 30 minutes in the teacher's 
own classroom, followed by 30-45 minutes of coaching. This duration was defined by a 
combination of time and cost effectiveness. Because of the importance of collective 
participation, the aim was to involve most teachers of the lower grades to participate. In the 
actual intervention, we had three participants in one school, and four participants in the other 
school, which gave the teachers the possibility to share their experiences. 

The focus in the intervention was on improving teachers' pedagogical content knowledge, 
by means of active learning. That is, during the four lessons with subsequent coaching, teachers 
were given the possibility to put the learned framework into practice and to reflect on their own 
lesson with the help of a trained coach as described in 3.2. Moreover, the focus was on 
coherence with other learning activities. This aspect was not incorporated in the design of the 
intervention, although schools usually volunteered to participate because they viewed science 
as an important topic in the school curriculum. Thus, a broader interest in the topic implicated 
that more activities regarding science lessons are already being undertaken at the school, 
although not necessarily in coherence with other learning activities. 

3.3.2 Teacher specific aspects 

The teacher specific aspects were addressed by taking teachers' concerns, (autonomy, 
relatedness and competence) into account, so that teachers could feel more intrinsically 
motivated to use the newly learned content. 
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Firstly, the teachers' competence concern was addressed by presenting the teachers with an 
interactive lecture on the knowledge they need when teaching science. The aim was to let 
teachers experience that they already possessed enough content knowledge to teach science to 
young children, e.g. about floating and sinking, gravity and air pressure experiments. 
Furthermore, during the video coaching, the coach addressed not only learning points, but also 
showed the teachers video shots in which the teacher showed very competent behavior. 

The autonomy concern was addressed by letting teachers voluntarily participate in the 
intervention. Schools were asked to include only teachers who were enthusiastic about teaching 
science and who wanted to learn how to improve their teaching skills at this topic. Furthermore, 
participating teachers were asked to teach science lessons of their own choice in their classroom 
during the trajectory. Moreover, teachers were asked to formulate their own learning goal, as 
autonomous goal setting allows them to be responsible for their own learning trajectory and 
leads to better results in the process of behavioral change (Koestner, 2008). 

As for the relatedness concern, we asked schools to let more than one teacher of one school 
team participate, so that these teachers could experience and discuss the theory session 
together. In addition, the improved questioning and reacting to pupils contributed to the 
relationship between teacher and pupil. 

3.3.3 Context specific aspects 

Only school specific factors were included in the design of the intervention because they could 
be addressed directly. Organizational support in training, for instance, extra time for study and 
practice, and supervision and support by the school principal during the intervention's 
implementation and afterwards, were important activities within the design. School principals 
were asked to endorse the starting points of the intervention, agree with the time the 
intervention would take, and to support their teachers by providing substitute teachers in the 
class during the coaching sessions. The principal and the colleagues who did not participate in 
the intervention also had the opportunity to attend the theory sessions, so that all teaching staff 
had the same level of knowledge, even though not everyone participated in the overall coaching 
program. In school A, three teachers participated in the coaching program, and in school B four 
did, which made the sharing of mutual experiences possible. 

3.3.4 Aspects regarding implementation 

The intervention as a whole was clearly documented. A manual of the intervention was made 
available to enable dissemination of the intervention to other trainers and to ensure treatment 
integrity (Wetzels, Steenbeek, & Fraiquin, 2011). 

4 Studying the Effect of the Intervention Using a Complexity Approach 

The following multiple case study is part of the effect study that was conducted in order to 
examine the effectiveness of Video Feedback Coaching for Teachers (VFC-T), the coaching 
program for teachers supporting the teaching of science in the lower grades. The aim of this 
case study was to illustrate how, in the concrete situation of a particular teacher, the effective 
aspects work together in a specific way, and in fact either amplify each other or reduce the 
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influence of the intertwined aspects to produce a specific effect of the intervention. We also 
assume that these processes of self-amplification or self-reduction are likely to be highly 
idiosyncratic, and thus will cause high inter-individual variability between teachers. 

4.1 Participants 

Two teachers participated: Laura (age 46, teaching experience 23 years) from school A and Anne 
(age 53, teaching experience 22 years) from school B. Both teachers were representative of their 
co-workers in the same school as classroom teachers of pre-schoolers (age 4-6), and had almost 
the same knowledge of science and science education as well as experience in teaching science. 
The selection was based on information we collected with regard to age, knowledge of science 
teaching and experience in teaching science before the intervention. For the coaching trajectory 
they each chose four children from their class, children they considered representative for the 
whole class with regard to their age, performance level, and ethnic background. The pupils' age 
ranged from four to six years, and none of them were from an ethnic or cultural minority 
because in the part of the country where the study was carried out the number of children of 
ethnic or cultural minorities is relatively small. 

4.2 Data collection, variables and analyses 

The study uses a mixed method, combining quantitative and qualitative analyses (Creswell & 
Plano Clark, 2011). Data were collected by using video recordings of the classroom activities, 
which were captured with two digital camcorders, one focusing on the teacher, and the other 
one on the participating pupils. Afterwards, the video recordings were coded, using a coding 
scheme for the variables "amount of questions" and "level of reasoning". 

Two coding schemes were developed for coding teachers' (Wetzels, Steenbeek, & Van 
Geert, 2015) and pupils' utterances (Wetzels, Steenbeek, & van Geert, 2015). The coding scheme 
regarding teachers' utterances was operationalized as the variable 'teachers' scientific reasoning 
eliciting questioning' (TSEQ), and it was developed using literature that stresses the importance 
of questions regarding the empirical cycle for science learning (De Groot, 1961; Dejonckheere, 
Van De Keere, & Mestdagh, 2009; Engel & Randall, 2008; Oliveira, 2010). All questions were 
selected and coded, if possible, in terms of the steps of the empirical cycle (see section 3.2): 
'knowledge questions', 'prediction questions', research design questions', 'observational 
questions' and 'questions about the explanation'. Additional categories of questions were: 
'follow up questions', 'other questions relating to the content', and 'other questions'. 

The coding scheme (table 1) regarding pupils' utterances was operationalized as the 
variable 'pupils' level of reasoning' (from now on PLS), and is based on skill theory (Fischer & 
Bidell, 2006). It describes the cognitive level in pupils' utterances on an interval scale, according 
to their skill level. For each lesson, which lasted about 20-30 minutes, the highest level one of 
the pupils reached was coded per minute, and subsequently the mean score for each lesson was 
calculated. When children said nothing during a minute, it was coded as zero because in this 
case the teacher did not elicit children's reasoning at all. At the least complex skill level we 
coded a 1. A 2- 5 code indicated increasing skills. (Meindertsma, Van Dijk, Steenbeek, & Van 
Geert, 2012; Rappolt-schlichtmann, Tenenbaum, Koepke, & Fischer, 2007) 
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Table 1: coding scheme for pupils' level of scientific reasoning (PLS) 


Code 

Skill level 

Description 

Example 

0 


No answer is elicited by 
the teacher 


1 

Sensorimotor 

A child observes 

This is a blue pencil 


mapping 

characteristics of an 
object 


2 

Sensorimotor 

Child states a 

The pencil floats because you 


systems 

relationship between 
action and result 

placed it in the water 

3 

Single 

Child refers to one part 

The pencil floats because it is 


representation 

of the explaining 
mechanism 

small 

4 

Representational 

Child refers to two or 

The pencil floats because it is 


mapping 

more parts of the 
explaining mechanism 

small and light 

5 

Representational 

Child refers to all 

The pencil floats because it is 


system 

explaining mechanisms 

light for its size in the water 


Differences between pre and post-intervention and pre-intervention and intervention 
lessons for each individual case were analysed using Monte Carlo permutation analyses 
(Todman & Dugard, 2001), a non-parametric permutation test, particularly suitable for small 
sample sizes. The probability that results are caused by chance alone was estimated by 
simulating the chance-alone condition. This simulation is done by randomly shuffling all results 
numerous times (1000 or more). The original result is then compared with the distribution of the 
results obtained by the random model, using a p-value of .05, to determine the probability of 
whether the observed result for each particular teacher might be due to chance alone. 

After the last post-intervention, the participants were interviewed by using a semi- 
structured interview technique. Questions focused on the way the teachers had experienced the 
coaching trajectory and on the changes the teachers observed regarding their own behavior and 
regarding the children in the classroom. 

4.3 Results 

4.3.1 Quantitative findings - TSEQ and PLS 

The results of Laura (as represented in Ligure la) show that the amount of TSEQ increased 
significantly after the two pre-intervention lessons and the theory session. This higher amount 
of TSEQ stayed at a higher level during the four coaching sessions (p=0.02). However, it 
dropped sharply two months after the coaching sessions, during the post-intervention lessons, 
which implies that no statistically significant results were seen in the longer term (p= 0.50). With 
regard to the variable PLS (Figure lb), a significant increase was found during the coaching 
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lessons (p= 0.02) in comparison with the two pre-intervention lessons, and this higher level was 
continued during the post-intervention lessons (p=0.03). 

On the contrary, with Anne (Figures 2a and 2b), no significant changes occurred during the four 
coaching sessions, nor two months after the coaching sessions, during the post-intervention 
lesson, both regarding the amount of TSEQ (short term p=0.87, long term p= 0.69) and the PLS 
(short term p= 0.89, long term p=0.77). 


Pretests 


Posttests 


160 

140 

120 

100 

80 

60 

40 



12345678 
la. Amount of emprical cycle questions 



Laura 

Pretests Posttests 

4 

3 

2 

1 

0 

lb. Mean level of reasoning pupils 









.♦ 





- 

♦. 4 




12345678 


Anne 



2a. Amount of emprical cycle questions 


2b. Mean level of reasoning pupils 


Figure la, lb, 2a and 2b. Results Laura and Anne 


In the interview, both teachers were positive about the form of the intervention in that 
they gave the intervention a very positive rating (Anne gave a 7, Laura gave a 8 on a scale of 1- 
10). However, the quantitative variables show a high amount of inter-individual variability 
with regard to the teachers' questions, and the pupils' 'level of reasoning'. Presumably related 
to the differences between the teachers, Anne shows a higher amount of intra-individual 
variability than Laura. 

4.3.2 Comparison of the two trajectories regarding the influencing aspects and the intertwining 
between them 

In the following we qualitatively describe (by giving examples) how all aspects have an 
influence on the effectiveness of the intervention in the multiple case study, not only alongside 
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each other, but also intertwiningly influencing each other, using the interview data of both 
teachers. 

The science teaching intervention was designed with several effective aspects incorporated 
(c/ the design of intervention section above). The intervention-specific aspects, both structural 
and substantive, were the same for both teachers, as well as the treatment integrity, as the coach 
was the same for both teachers and used the same documentation. 

Some examples of differences between the two trajectories can be clearly observed: for 
instance, the difference regarding the fit between school and intervention. Laura recognized 
that the intervention fit extremely well with the cornerstones of her school's teaching, namely 
the Reggio Emilia approach (Edwards, Gandini, & Forman, 1993). One of the aspects in the 
coaching program was scaffolding, requiring a fit with the level of the child and providing 
specific attention to each individual child. This manner of working with specific attention for 
individual children is a key feature of the Reggio approach, and Laura was happy to see how 
both merged together. That is to say, important content aspects of the school background and of 
the intervention were clearly consistent with one another, which might have resulted in 
amplifying Laura's enthusiasm. On the contrary, in Anne's school, science was an important 
subject; however, most teachers in that school were not required to participate in the science 
program themselves, since the responsibility was given to a specially appointed science teacher, 
which likely led to self-reduction. That is, Anne felt like the coaching did not help her with the 
regular teaching, probably did not see any result that she considered valuable, and most likely, 
for this reason, concluded that the coaching required too much of her time, as she repeatedly 
said during the interview. 

Regarding the teacher specific aspects, clear differences between the teachers could be 
seen with regard to feelings of competence and autonomy. During the pre-intervention lessons, 
Laura did not feel competent in teaching science, but she wanted to gain more knowledge and 
skills on this subject. She was very critical at the start of the theory session, but she became 
increasingly enthusiastic, and she understood the advantages of teaching science in this 
challenging way. On the contrary, Anne had some experience with science education, but 
primarily with the science coordinator giving science lessons in her classroom. During the 
theory lesson she mentioned that she wanted to help with the research, but that she felt she 
already knew enough about science teaching, although she had never put this knowledge in 
practice herself. This indicated a clear difference in the fit between the intervention and the 
teacher's simplex systems (see: Van Geert & Steenbeek, 2014). 

Regarding their feeling of autonomy, it was clear that Laura was positive about the 
opportunity to participate: "Our principal told us about the opportunity. And the topic seemed 
very interesting to me and my colleagues, so we said yes". In contrast, Anne's answer was: "Oh, 
we had to do this. All the other colleagues said no, and someone had to participate. And then I 
said yes, but it took much more time than I expected". In fact, it seems that Laura's decision was 
based on the content of the program, and enhanced her feeling of autonomy, whereas Anne 
decision was not content-based, and even seemed to threaten her feeling of autonomy. 

Furthermore, both teachers set their own learning goals. During the interview, Laura 
said: "the most important learning goal for me was to hold myself away from talking and 
answering, and give the children room for their answers. And I do much better at this now." 
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She could still recall her learning goal two months later, and could describe what she had 
learned. Anne could not recall what her goal had been. In her own word: "it was such a long 
time ago". Stated in terms of concerns, Laura seems to address her basic concerns through the 
coaching. On the contrary, Anne very clearly explained that the coaching did not address any of 
the concerns she finds interesting, in the sense that she complained about the amount of time it 
required, without identifying any feeling of positive results (the concern that she expressed was 
an external concern, namely the willingness to participate in scientific research, not the concern 
of changing her own competence) (Steenbeek & Van Geert, 2013). 

Regarding the school-specific aspects, two aspects show clear differences between Laura 
and Anne, namely the nature and extent of the school support and the school wide approach. In 
Laura's school, the principal was involved in the process of teaching science. The principal 
wanted the school to participate in the intervention research because she intended to start with 
a few teachers in the lower grades, and to eventually extend this to the rest of the teachers in the 
school. To express her commitment and make this observable within the organization, she 
participated in the theory session and in the discussion on teaching science that emerged during 
that session. She ensured that teachers would have time to participate and that their pupils 
would be taken care of during the intervention time. Laura and her co-workers felt supported 
and they understood how important the intervention was for the future of the school. In 
contrast, in Anne's school, the teachers participated in the research project because the science 
coordinator asked them to. The school principal did not know about it, and during the theory 
session the teachers participated alone, without the science coordinator or the principal. During 
the coaching sessions, Anne's replacement in the classroom was taken care of, however Anne 
felt the trajectory took too much of their time. 

Regarding the school-wide approach, the general idea at both schools was that in the end 
all teachers would learn how to teach science. In Laura's school, all four lower grade teachers 
participated, because the principal had asked them to be an example for the upper grade 
teachers, who would follow the trajectory at a later time. The principal had asked them in 
advance to think of a way to communicate the learned content to their upper grade colleagues 
at the end of the coaching program in order to inspire them too for the science teaching 
program. In Anne's school on the contrary, only three of six lower grade teachers participated, 
and no plans were made regarding whole school dissemination. 

In the previous section, the intertwining of different aspects shows that in the case of Laura, 
she was enthusiastic about the intervention because its goals fit with the school's Reggio Emilia 
approach, the principal was enthusiastic, as well as her co-workers. Her personal concerns, 
school specific aspects and intervention specific aspects worked intertwiningly together in a 
positive spiral to produce a relatively positive result. However, in the case of Anne, the 
intertwining of aspects unravelled very differently, i.e., she was not enthusiastic nor was her 
principal, which led to a relatively negative result. 

5. Discussion 
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The aim of this article was to give a theoretical overview of effective aspects of interventions 
regarding teachers' professionalizing, to re-interpret the effectiveness of these aspects from a 
complexity view, and to illustrate this complexity view with a multiple case study. 

In principle, the results presented in our case studies could be interpreted in terms of 
simple, linear additions of independent variables, such as variability in teacher ability, in the 
student's intelligence, variation in motivation, variation in students' experience and so forth. 

The complexity view, on the other hand, provides an interpretive framework focusing on the 
reciprocal influences of all these, and other factors and components. In these two case studies, 
we have illustrated how intervention specific aspects, teacher specific aspects, context specific 
aspects and implementation specific aspects intertwine, and act as a complex dynamic system, 
that is to say as a system of interacting components, leading to self-organizing patterns of 
asking questions, giving explanations, involvement in the task, the teachers willingness and 
effort to improve his or her ability to teach science, and so forth. From a complexity view, we 
focus on an interaction-dominant analysis (Van Orden, Holden, & Turvey, 2003) of intervention 
effects, which entails the analysis of how these adjacent processes interact and change each 
other's dynamics, and in which each interaction pays attention to the specific conditions of the 
adjacent processes. In this study, this means that we first looked at how the intervention works 
at the level of real-time activities of individual teachers in the context of their schools, and 
subsequently we looked at the dynamic interplay between variables that matter at this level of 
analysis, such as individual concerns and individual evaluations of the value of the obtained 
effects. In this way, an intervention can be seen as a process, a sequence of activities of a teacher, 
her children, and an intervener. How plausible is it that such a sequence of activities in which 
the teacher, children and intervener react to each other in a variety of ways can be explained by 
a superposition of influences exerted by many independently operating variables? In our view, 
such an explanation is highly implausible: it does not allow us to interpret the observable 
interactions that take place during these processes. The alternative theory - and most likely the 
only alternative present — for interpreting processes of change in teacher and pupil activities 
intertwined is a theory that treats such processes as examples of interactions between 
components, as conceptualized in the general theory of complex dynamic systems. 

In large scale research it is impossible to pay attention to all the influences that can play a 
role in an intervention process. From a complexity perspective the description of all aspects is 
not possible because of the multitude of possible aspects, and because these aspects 
continuously change over time. It is important, however, to recognize that all contextual aspects 
influence each other intertwiningly and form positive or negative feedback loops. These effects 
need to be taken into account when assessing the effectiveness of an intervention, or when 
providing an intervention. Moreover, when designing an intervention from a complexity 
perspective, attention must be paid to the fit between intervention, teacher, provider, and the 
context in the broadest sense. The notion of the intertwining and reciprocal relations between all 
aspects requires flexibility and an open mind of the trainer. The trainer himself is in a complex 
dynamic system, embedded in the process itself, and therefore does not function as an external 
factor. While maintaining the intervention integrity, the trainer must be able to make 
adaptations at the right time and at the right place, by focusing on the intervention as an 
idiosyncratic, individual-based process. We speculate that the example at the end of the 
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previous section could have worked out very differently if the principal had been very positive 
about the intervention. If she or he had shown this enthusiasm by participating in the theory 
session, Anne's concerns could have changed her motivation to more intrinsic motivation for 
participating, instead of the extrinsic motivation she had. This change into intrinsic motivation 
could also have had its influence on her idea about the learning goals and the length of the 
intervention. The results and the effectiveness of the intervention could have displayed a more 
positive picture for Anne in this situation. 

Some limitations of this research need to be mentioned. Firstly, only a part of the 
influencing context is discussed. More aspects are important when studying an intervention's 
effectiveness (Boelhouwer, 2013; Boomstra, 2014), and these aspects can vary from the 
enthusiasm of the children, to bureaucracy that enforces teachers to unwanted work, and to the 
influence of a teacher's sick child at home. An exhaustive description, however, of the 
influencing aspects is not necessary in a complex dynamic system because what matters are the 
intertwining processes and the interactions between the influencing aspects. Secondly, the 
empirical illustration is limited to two cases, in which not all possible aspects are visible, and 
therefore cannot be taken into account. However, the results of this study show the importance 
of a complexity approach when intervening. 

Designing and implementing a sustainable intervention for professional development is 
not an easy task, not least because most systems resist change in the long run. This kind of 
resistance can only be understood from a complexity viewpoint concerning the role of 
interventions, in that all aspects are seen as components that sustain each other (Frenken, 2006). 
It can take up to three to five years for teachers to fully implement a new practice or program, 
and therefore expecting change in a short period of time is unrealistic (Loucks-Horsley et al., 
2010). This can be regarded as a plea for long-term intervention processes, as well as 
longitudinal studies in order to examine the effects of interventions over a longer timeframe. 

The complexity approach to interventions has some important implications for future 
empirical research, as well as for the design of interventions. In future research, results of small 
scale and large scale research should be combined, in order to obtain a better insight into all 
relevant aspects and their effect on teachers' behavior. In addition, research should concentrate 
on the effects that all contextual aspects have intertwiningly. Once more knowledge is available 
concerning how aspects influence each other contextually, this knowledge can be used to design 
more effective interventions and to implement them more effectively with higher chances that 
the system incorporates the new influence and that effects will last. 
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